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Executive  Summary 

This  report  summarizes  our  research  and  development  work  in  the  last  quarter  of  year  2008.  We  have  reached  a 
practical  limit  in  terms  of  sampling  rate  and  bandwidth.  Listed  below  are  major  achievements  during  the  quarter: 

•  improved  waveform  generator:  FPGA  coding  with  respect  to  readability,  quantization  at  the  transmitter  has 
been  increased  from  ternary  to  8-bit,  and  sampling  rate  has  been  increased  from  500  Ms/s  to  1  Gs/s; 

•  I/Q  quadrature  modulation/upconversion,  instead  of  single-channel; 

•  Xilinx  Virtex  5  LXT  board  at  the  receiver  with  new  high  speed  connection  to  the  A/D; 

•  variable  thresholds  for  various  signal  levels; 

•  10-dB  Bandwidth  up  to  800  MHz; 

•  multiple  bit  rates:  6.25,  3.125,  and  1.5625  Mb/s; 

•  improved  receiver  sensitivity. 


Our  next  milestone,  a  gain  of  several  dB  over  ordinary  non -time-reversal  system,  is  anticipated,  and  system  trials 
will  be  reported  in  the  April  report. 
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Chapter  1 


Unique  System  Design  Issues 

1.1  Passband  Waveform  Precoding 


A  flexible  way  to  generate  passband  signal  is  to  use  a  dual-channel  baseband  waveform  generator  followed  by  an  I/Q 
up-converter  (modulator).  In  this  way,  the  spectral  shape  can  be  easily  controlled  by  changing  the  local  oscillator’s 
frequency  and  the  baseband  waveforms.  Considering  the  nature  of  the  energy  detector  receiver,  the  goal  of  transmit 
waveform  precoding  is  to  achieve  a  desired  signal  envelope  at  the  input  to  the  detector.  If  w(t)  and  h(t)  are  the 
baseband  representations  of  the  transmit  chip  waveform  and  the  channel  impulse  response  (C1R),  respectively,  then 
the  transmit  waveform  and  the  CIR  can  be  expressed  as 

w(t)  =  Re{w{t)e~i2**ct}y  (1.1) 

h(t)  =  Re{h(t)e-i2*f'1},  (1.2) 


where  fc  is  the  center  frequency.  It  can  be  proved  that  the  received  chip  waveform  is  given  by 


y(t)  =  w(t)  <g>  h(t) 

=  Re{[Uv(t)®h(t))e-}2*f't} 

=  Re{y(t)e-^t},  (1.3) 


where  is  convolution  operator  and  y{t)  =  \w(t)  <g>  h(t)  is  the  baseband  representation  of  the  received  chip 
waveform  without  noise  pollution.  In  other  words,  the  transmitter-receiver  chain  can  be  represented  equivalently  in 
baseband  as  illustrated  in  Fig.  1 . 1,  where  the  factorof  1/2  (see  above  equation)has  been  absorbed  into  h(t).  Note  that 
h(t)  takes  into  account  the  impacts  of  propagation  channel  and  RF  front-ends  including  antennas  at  both  sides. 


1.2  Estimation  of  the  Equivalent  Filter  in  the  Transmitter-Receiver  Chain 


The  equivalent  baseband  filter  in  the  transmitter-receiver  chain  has  to  be  taken  into  account  in  order  to  have  optimum 
waveform  at  the  receiver  detector  This  filter  is  generally  not  flat  over  the  frequency  band  of  interest,  especially  for 
wide  bandwidth  case.  Estimating  and  controlling  the  equivalent  baseband  filter  is  not  easy.  As  shown  in  Fig.  1 .2,  the 
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n(t) 

Base-band  system  representative  with  energy  detector  receiver. 
(Tt  is  the  integration  window  size) 


Figure  1.1;  Baseband  system  equivalence 


Receiver 


Figure  1.2;  Equivalent  filter  in  transmitter-receiver  chain  (not  including  the  propagation  channel),  contributed  from  three 
sources. 


overall  filter  effect  is  contributed  from  three  sources:  the  D/A’s  interpolating  filter  p(t)  and  RF  front-ends  including 
antennas  at  both  sides.  By  omitting  some  scalar  factor,  the  equivalent  filter  in  baseband  format  is 

F(t)  =  F0(t)®F1(t)®F2(t),  (1.4) 

and  Fo(t)  can  be  expressed  using  the  interpolating  filter  p(t) 

Fo(t)  =  p(t)  +j  p(t).  (1.5) 

Accordmg  to  our  measurement,  the  shape  of  the  equivalent  filter  is  dominated  by  the  interpolating  filter  p(t).  Based 
on  the  A/D  used  in  the  testbed,  p(t)  has  been  accurately  measured  and  Fo(t)  =  p(t)  +  j  •  p(t,)  has  been  used  as  an 
estimate  of  F(t). 


1.3  Discrete-Time  Representation 


Practically  speaking,  precoding  is  performed  in  the  digital  back-end  and  the  baseband  waveform  should  be  repre¬ 
sented  in  digital  format  (discrete-time  and  quantized)  by  at  sampling  rate  I/T5.  Similarly,  h(t)  can  be  rep¬ 
resented  in  discrete-time  by  hn  at  sampling  rate  M\/Ts ,  M\  >  1.  It  is  assumed  that  the  sampling  rate  I/X5  is 
determined  by  the  hardware  limitation,  and  the  sampling  rate  M\/Ts  is  no  less  than  the  Nyquist  rate  based  on  h(t). 
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Figure  1.3:  Continuous-time  waveforms  generated  from  discrete-time  waveforms  (sequences). 

Fig.  1.3  is  a  simplified  diagram  showing  the  continuous-time  transmit  waveform  w(t)  and  receive  waveform  y(t) 
originate  from  the  digital  waveform  xm.  It  is  necessary  to  find  how  y(t)  is  calculated  from  sequences  xm  and  fin. 

Let  us  first  generate  an  analytical  up-sampling  signal  xC,  corresponding  to  arm: 


/  _  j  x  i 
Cm/  \  ( 


m,  if  m!  —  M\m 
0,  otherwise 


(1.6) 


so  thatrc^,  and  hn  have  the  same  sampling  rate  Mi/Tg.  By  using  the  ideal  interpolating  function  sinc(M\t/Ts ), 
the  target  waveform  y(t)  can  be  expressed  as 


y(t) 


=  El 

Let  v  —  r  —  m!Ts/M\  and  define 

q(t)  =  p(t)  0  sinc(M\t/Ts), 

then  y(t)  can  be  further  expressed  as 

y(t)  =  YLYLx'™,hn  [  p{y)sinc(Ml\ 1  ~~  ("»'  +  n))  dv 

m'  n  J~°°  \  lS  / 

+  n)Ts 


(1.7) 


(1.8) 


,  (.  ( m'  +  n 

~  Z2z2xm’,ln'l  (  <  M 

m'  n  x 

=  S  X]  xmlhk-m‘<l  (*■  —  ^ 

km *  '  ' 

=  Y,^2Xmhk  mMi <i ’ 


(1.9) 


k  m 


where  k  =  mf  4*  n  has  been  applied.  It  is  worth  noting  that  in  the  above  equation  both  rr^,  and  hn  are  in  complex 
format,  xmhk  m\fi  represents  discrete-time  convolution,  and  q(t)  is  a  non-ideal  interpolating  function. 

The  target  waveform  can  also  be  represented  based  on  the  equivalent  CIR  h(t )  defined  as 

h(t)  =  p(£)0/i(£) 

=  Y  h„sinc  -  n 


6 


CHAPTER  1 .  UNIQUE  SYSTEM  DESIGN  ISSUES 


where  hn  is  a  discrete-time  version  of  h(t)y  sampled  at  the  Nyquist  rate  (or  above  that)  M2/T5.  Define  as 


if  m!f  = 

0,  otherwise 


then  y(t)  can  be  expressed  in  a  similar  form: 


2/(0 


EE*™"^  -m"sinc 

k  m" 


EE  StflC 

k  m 


V*5 


(1.10) 


(111) 


1.4  Balanced  I/Q  Quantization 

xm  is  actually  a  digitized  complex  signal  with  certain  quantization  resolution.  To  make  full  use  of  the  A/D’s  quan¬ 
tization  capability,  it  is  desired  to  balance  the  peak  values  in  the  real  and  image  parts,  and  use  the  peak  value  as  the 
D/A’s  full  scale.  When  energy  detector  receiver  is  used,  the  waveform  precoding  is  to  achieve  a  desired  envelope 
Vopt{t)  at  the  receiver  detector.  If  this  balance  requirement  is  not  met,  we  can  always  equalize  the  two  peaks  by 
applying  phase  rotation  to  the  original  waveform  xo,m  which  is  in  complex  format. 

x0,m  =  *0 ,m+j-x0,m-  (112) 

If  there  is  no  waveform  overlapping  between  successive  chips,  after  phase  rotation  by  an  angle  of  6 ,  the  new  wave¬ 
form  is 


1 0 

—  *^0,771 

=  xI0,mcosd  +  J  ■  x0,msine 
=  xm+j  xm- 

(1.13) 

By  solving  the  following  equation  for  6 : 

max{\xQ >m|}cos0  =  max [\x^  m\} sinO , 

(114) 

9  is  given  by 

„  1  f  maX{\X0,rn\}\ 

y  771(2x{|j"q  m|}  J 

(1.15) 

When  waveform  overlapping  between  successive  chips  is  inevitable,  assume  the  Xo  m  is  a  composition  of  successive 
chip  waveforms  before  phase  rotation: 


then  the  rotation  angle  is 


-^0,771 


X1 

A  0,771 


6  =  tan 


(  maj{|Aqm|}\ 


(1.16) 


(1.17) 


Chapter  2 


Implementation  Aspect 


8 


CHAPTER  2.  IMPLEMENTATION  ASPECT 


2.1  Arbitrary  Waveform  Generator 


Digital  Arbitrary  Waveform  Generator  (AWG)  is  the  main  part  in  the  backend  at  the  transmitter.  Digital  AWG  can 
generate  virtually  any  type  of  transmitted  waveform  according  to  our  objective,  which  greatly  increases  the  capability 
of  UWB  system.  Compared  with  our  first  generation  real-time  UWB  testbed.  Digital  AWG  in  the  current  generation 
real-time  UWB  testbed  is  significantly  improved.  Specifically  speaking,  the  quantization  resolution  is  increased 
from  2  bits  to  8  bits.  The  sampling  rate  of  output  digital  data  is  increased  from  500MHz  to  1  GHz.  The  number 
of  channels  is  increased  from  1  to  2.  Thus  I/Q  data  can  be  generated  simultaneously.  The  number  of  waveform 
coefficients  is  increased  from  40  to  160.  Data  memory  is  added  into  Digital  AWG.  Data  memory  plus  8  groups  of 
shift  registers  are  used  to  store  the  waveform  coefficients.  In  this  way,  the  loading  of  waveform  coefficients  into 
FPGA  is  much  easier  than  before.  Look-up  table  is  replaced  by  Boolean  operation  and  mathematical  computation 
which  can  support  the  calculation  of  input  data  with  higher  quantization  resolution. 

How  can  we  use  FPGA  with  550MHz  maximum  clock  rate  to  generate  the  output  digital  data  with  1GHz  sampling 
rate?  Parallel  to  Serial  Converter  gives  us  a  hope.  The  overall  structure  of  digital  AWG  for  one  channel  is  shown 
in  Figure  2.1 .  Waveform  coefficients  are  stored  in  Data  Memory  When  FPGA  powers  on,  waveform  coefficients 
can  be  automatically  loaded  from  Data  Memory  to  the  corresponding  group  of  shift  registers.  There  are  8  groups  of 
shift  registers.  And  each  group  of  shift  registers  corresponds  to  one  Process  Module.  The  clock  rate  of  each  group 
of  shift  registers  plus  the  corresponding  Process  Module  is  125  MHz.  Finally  Parallel  to  Serial  Converter  is  used  to 
generate  the  output  digital  data  with  1GHz  sampling  rate. 

For  each  channel,  we  have  8  groups  of  shift  registers  and  each  group  can  store  20  waveform  coefficients  shown  in 
Figure  2.2.  So  totally  we  can  have  160  waveform  coefficients  for  each  channel. 

In  the  first  generation  real-time  UWB  testbed,  we  used  look  up  table  in  Process  Module.  Now  we  use  the  multiplica¬ 
tion  and  addition  shown  in  Figure  2.3.  In  this  way,  the  Verilog  code  is  more  compact  and  readable.  The  calculation 
of  input  data  with  higher  quantization  resolution  can  be  supported. 

Format  Regulation  uses  Boolean  operation  to  generate  complementary  code  for  the  following  processing  in  Process 
Module  according  to  the  scrambling  code  generated  by  Scrambling  Code  Generator  and  the  value  of  input  chip. 

The  key  module  in  digital  AWG  is  the  high  speed  Parallel  to  Serial  Converter,  the  structure  of  which  is  shown  in 
Figure  2.4.  Through  this  way,  the  high  speed  requirement  of  the  output  digital  data  is  relaxed  by  parallel  computation 
and  Parallel  to  Serial  Converter.  The  function  of  parallel  to  serial  conversion  is  implemented  by  OSERDES  logical 
resources  in  Virtex-5  FPGA.  Each  OSERDES  logical  component  can  support  6  to  1  parallel  to  serial  conversion,  but 
we  need  8  to  1  parallel  to  serial  conversion,  so  two  components  are  used  to  build  one  converter.  One  component  is 
called  master  and  one  other  is  slave.  The  function  diagram  is  shown  in  Figure  2.5.  In  our  design,  the  sampling  rate 
of  input  data  from  8  parallel  branches  is  1 25  MHz  and  sampling  rate  of  the  output  digital  data  is  1  GHz. 
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Figure  2.1:  Overall  structure  of  digital  AWG  for  one  channel. 


Figure  2.2:  Digram  of  shift  registers. 
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Figure  2.3:  Strucuture  of  Process  Module. 
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Figure  2.4:  Strucuture  of  Parallel  to  Serial  Converter. 
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Figure  2.5:  Strucuture  of  OSERDES  expansion. 
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Figure  2.6:  Interfacing  DAC  evaluation  board  and  FPGA  evaluation  board. 


2.2  DAC 


The  high  speed  interface  between  the  DAC  and  FPGA  is  a  critical  issue  in  the  testbed.  The  interface  quality  of  our 
last  testbed  could  only  meet  single  channel,  500  Msps  and  2 -bit  quantization.  This  configuration  is  far  below  the 
maximum  performance  provided  by  Fujitsu  DAC,  featuring  dual  channel,  1  Gsps  and  14-bit  quantization.  In  order 
to  fully  use  the  Fujitsu  DAC,  we  found  a  interfacing  solution,  with  the  Xilinx  ML550  FPGA  evaluation  board,  an 
interface  adapter  built  by  Fujitsu  and  a  blue  ribbon  cable  (Fig.  2.6). 

ML550  provides  6  banks  of  LVDS  I/O  (a  total  of  53  differential  input  and  53  differential  output),  supporting  up 
to  1.2  Gbps  high  data  rate  transmission.  All  the  I/Os  are  synchronized.  The  maximum  data  rate  is  limited  by  the 
LX50T  FPGA,  which  is  just  enough  to  drive  the  l  Gsps  DAC.  If  we  want  to  drive  a  even  faster  DAC,  to  the  best 
of  our  knowledge,  parallel  to  serial  conversion  is  built  inside  the  DAC  chip,  resulting  a  reduced  burden  at  the  data 
driving  side.  For  example,  the  MAXIM  4.3  Gsps  DAC  uses  a  4:1  MUX  to  drive  the  DAC,  and  therefore  each  of 
the  4  data  channels  requires  a  1.075  Gsps  sampling  rate  only.  As  a  result,  the  ML550  I/O  interface  is  capable  of 
driving  the  highest  sampling  rate  DAC.  The  blue  ribbon  cable  is  a  high  speed  cable  provided  by  Samtec.  It  supports 
up  to  2.84  Gbps  data  rate,  which  is  far  beyond  the  our  maximum  data  rate  requirement.  The  interface  adapter  is 
customized  by  Fujitsu.  It  connects  the  Fujitsu  DAC  and  the  ribbon  cable.  The  pin  assignments,  a.k.a.  mappings 
between  the  Fujitsu  DAC  input  ports  and  FPGA  output  ports  are  tested  and  built  as  a  user  constraint  file.  It  can  be 
loaded  in  the  FPGA  design  and  drive  the  DAC  properly. 

The  purpose  of  the  interface  is  to  fully  utilize  the  performance  of  the  DAC,  which  is  the  dual  channel,  1  Gsps  and 
14-bit  quantization.  A  test  environment  is  built  to  test  the  performance.  All  DAC  outputs  are  driven  from  FPGA 
with  the  interface  solution.  FPGA  driven  waveforms  are  compared  with  the  waveform  memory  module  driven  ones, 
since  the  latter  ones  are  considered  to  be  correct  Fig.  ??  shows  the  two  FPGA  driven  ramp  waveforms  generated 
by  DAC  ports  A  and  B.  We  can  see  that  two  ports  are  synchronized  and  the  waveforms  are  the  same  as  the  ones 
generated  by  waveform  memory.  A  14-bit  mono-cycle  1  ns  pulse  is  generated,  as  shown  in  Fig.  2.7.  The  pulse  width 
is  over  1  ns  and  there  is  a  tail  behind  the  pulse.  This  might  be  caused  by  the  circuit  of  the  DAC  evaluation  board. 
The  spectrum  of  the  mono-cycle  pulse  is  depicted  in  Fig.  2.8.  Another  result  is  that  the  lower  9-14  bits  of  the  DAC 
are  under  the  noise  level  and  can  not  be  observed.  So,  the  configuration  of  the  DAC  becomes  dual  channel,  1  Gsps 
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and  8-bit  quantization,  which  is  a  great  improvement  to  the  previous  one. 


Figure  2.7:  A  1  ns  pulse  generated  by  DAC. 


Frequency  GHz 


Figure  2.8:  The  spectrum  of  a  1  ns  pulse  generated  by  DAC. 
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Chapter  3 


Testbed  Configuration  Issues 


For  our  test-bed  development,  we  have  a  lot  of  FPGA  and  DAC  configuration  files  for  different  working  cases,  the 
following  part  serves  as  a  memorandum  for  the  test-bed’s  configurations. 

3.1  FPGA  configuration  files  at  transmitter  side 

The  test-bed’s  tansmitter  configuration  files  mainly  include  ”ML550_SPIXO”  and  ’TirJdL550”  : 

•  ML550_SPIXO. 

Descriptions;  This  is  the  SPI  protocol  control  project  files  folder  at  transmitter  side. 

Functions: 

(1) ,it  generates  a  serial  data  streams  output  to  configure  the  PSA4000A  local  oscillator  to  work  at  exactly  4.0  Ghz. 

(2) ,  Provide  500MHz  output  clock  for  DAC  board,  this  is  useful  when  we  only  use  DAC  memory  to  transmit  time 
reversal  waveform  to  observe  the  focused  peak  at  receiver. 

Sources  files  list' 

(1) ,  spiJo.v,  it  serves  as  the  main  module. 

(2) ,  pll.v,  it  is  used  to  provide  proper  working  clock  signals. 

(3) ,  spi  Jo.ucf,  which  defines  the  input  and  output  pins  assignment. 

Pins  input  and  output: 

NET  ”le”  LOC  =  ”AK32”;  connected  to  P9-8  on  FPGA  board 
NET  ”mosi”  LOC  =  ”AH32”;  connected  to  P9-6  on  FPGA  board 
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NET  ”sclk”  LOC  =  ”AK33”;  connected  to  P9-4  on  FPGA  board 
NET  ”led_show”  LOC  =  ”AJ7”; 

NET  ’’waveform-clk^OO-p”  LOC  =  ”AD10”; 

NET  ”waveform_clk_500_n”  LOC  =  ”AD1 1”; 

key  parameters: 

(1) ,  The  main  working  clock  is  4MHz,  which  works  for  SPI  control  signals  generation. 

(2) ,  The  output  clock  for  DAC  board  is  500MHz,  where  a  differential  buffer  ”‘OBUFDS”’  is  used. 

(3) ,  The  4  SPI  configuration  registers  value  can  be  changed  to  configure  LO  works  at  other  frequencies  form 
3975MHz  to  4025MHz,  where  register  ”‘NJLatch”’  is  the  critical  one,  the  related  information  can  be  found  in 
ADF4106  frequency  synthesizer  data  sheet. 

Notes:  Each  time  the  local  oscillator  is  powered  off,  it  should  be  configured  again  when  power  on  next  time. 

•  Tir_ML550. 

Descriptions:  This  is  the  main  project  files  folder  in  transmitter. 

Functions: 

(1 ) , Output  transmitted  digital  data  frame  by  frame,  the  frame  structure  can  be  found  in  references. 

(2)  it  provides  500MHz  clock  signal  for  DAC  when  FPGA  are  used  for  transmitting. 

(3) ,  it  generates  synchronization  signals  for  digital  oscilloscope  or  other  equipments  in  receiver  side. 

Sources  files  list: 

(1 ) ,  OOK_Tx_ML550_V8.v.  this  is  the  top  module  at  transmitter  side. 

(2) ,  sampler. v.  this  module  is  used  to  store  transmitted  data  in  RAM  and  output  the  frame  to  encoder  module. 

(3)  encoder. v.  this  is  the  function  module  for  channel  encoding,  it  is  empty  currently. 

(4) ,  modulator.l  .v.  it  is  a  function  block  for  modulation  consisting  of  pulse  repetition  and  OOK  modulation. 

(5) ,  ss.v.  this  is  the  block  for  spread  spectrum,  it’s  empty  currently. 

(6) ,  pulser.v.  it  is  used  to  generate  short  pulses  or  chips  at  certain  chip  rate. 

(7) ,  ramJoad_v2.v,  this  is  the  module  for  storing  waveform  source  data,  and  to  be  feed  to  waveform  generator 
module. 

(8) ,  wavefonn_generator_v2.v,  this  is  the  function  module  used  to  generate  waveforms  need  based  on  transmitted 
chips  and  stored  waveform  sources. 
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(9) ,  oserdes81_8bit.v,  this  is  the  8  to  1  parallel  to  serial  conversion  module,  and  the  data  width  is  8  bits.  (9), 
oserdes81.v,  this  is  the  submodule  for  oserdes8L8bit.v,  the  conversion  is  1  bit. 

(10) ,  pll.v,  it  the  clock  management  module  used  to  provide  various  clock  signals  for  other  modules. 

(11) ,  OOK_Tx_ML550_V8.ucf,  this  is  the  top  constraint  file  at  transmitter  side,  it  defines  the  input  and  output  pins 
assignment  and  apply  various  timing  constraints  for  the  project. 

pins  input  and  output: 


NET  ”board_clk.p”  LOC  =  ”H17”; 

NET  ”board_clk_n”  LOC  =  ”H18”; 

NET  ”rst_sw”  LOC  -  ”W34”; 

NET  ’ ’wave form_c lk_500_p”  LOC  =  ”AD10”; 

NET  ” wave  form_clk-5  00 _n’ ’  LOC  =  ”AD1 1”; 

NET  ”system_clk_  125  .sync”  LOC  =  ”J10”; 

NET  ”system.clk_l  25-sync”  IOSTANDAR D  -  ’’LVCMOS25”; 
NET  ’ ’wave form-data  J_p[0] ’ ’  LOC  =  ”AG28”; 

NET  ’’waveform_data_l_n[0]’’  LOC  -  ”AH28”; 

NET  ’’waveform_data_Lp[  1]’’  LOC  =  ”AD24”; 

NET  ’’waveform_data_I_n[  1  ]”  LOC  -  ”AE24”; 

NET  ”waveform_dataJ-p[2]”  LOC  -  ”AE27”; 

NET  ”waveform_dataJ_n[2]”  LOC  =  ”AE26”; 

NET  ”waveform_dataJ_p[3]”  LOC  =  ”AC25”; 

NET  ”waveform_dataJ_n[3]”  LOC  -  ”AC24”; 

NET  ”waveform_dataJ_p[4]”  LOC  =  ”AB27”; 

NET  ’’wave form_dataJ_n[4]”  LOC  =  ”AC27”; 

NET  ’ ’wave form_data J -_p[ 5] ”  LOC  =  ”AB25”; 

NET  ’ ’wave form_dataJ _n[5]”  LOC  =  ”AB26”; 

NET  ”waveform_dataJ-p[6]”  LOC  =  ”AB28”; 

NET  ”waveform_dataJ_n[6]”  LOC  =  ”AA28”; 

NET  ”waveform_dataJ_p[7]”  LOC  =  ”Y24”; 

NET  ”waveform-dataJ-n[7]”  LOC  =  ”AA24”; 

NET  ’’wave form_data_Q_p[0]”  LOC  =  ”AD30”; 

NET  ”waveform_data_Q_n[0]”  LOC  =  ”AC29”; 

NET  ”waveform_data_Q_p[  1  J”  LOC  -  ”AF29”; 

NET  ”waveform_data_Q-n[l]”  LOC  =  ”AF30”; 

NET  ’ ’wave form_data_Q_p [ 2]”  LOC  -  ’T26”; 

NET  ”waveform_data_Q-n[2]”  LOC  =  ”W26”; 

NET  ’ ’wave form_data_Q.p [3]”  LOC  =  ”Y28”; 

NET  ”wavefonn_data_Q_n[3]”  LOC  =  ’T29”; 

NET  ”waveform_data_Q_p[4J”  LOC  =  ”V25”; 

NET  ”wavefonn_data_Q-n[4]”  LOC  =  ,rW25”; 

NET  ’ ’wave form_data_Q_p [ 5] ”  LOC  -  ”V30”; 

NET  ”waveform-data_Q-n[5]”  LOC  =  ,TW30”; 

NET  ”waveform_data_Q_p[6]”  LOC  =  ,TW24”; 

NET  ”waveform-data-Q_n[6]”  LOC  =  ,7V24”; 

NET  ’ ’wave form_data-Q_p [ 7] ”  LOC  =  ’T27”; 
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NET  ’’waveform_data_Q_n[7]”  LOC  =  ”W27”; 


Key  parameters: 

(1),  The  key  parameters  in  the  transmitter  is  the  waveform  sources  in  module  ”‘ramJoad_v2.v’”,  it  is  loaded  in  the 
RAMs,  there  are  two  RAMs  in  this  module,  one  is  used  to  store  I  channel  data,  and  the  other  Q  channel  data,  the 
data  length  is  currently  up  to  1 60ns,  data  width  is  8  bits,  the  data  format  can  be  shown  as  following,  where  there  are 
5  lines  ,  each  line  32  points,  we  can  either  transmit  160  points  time  reversal  waveforms  or  just  transmit  2ns  pulse. 
The  data  order  starts  form  the  right  8  bits  of  the  first  line(”‘09’”  in  the  following  example”’),  after  ends  in  the  left 
side,  it  starts  from  the  right  side  the  second  line(”409’”  in  the  following  example’”)  and  likewise.  These  data  are 
automatically  read  out  and  feed  to  other  modules  when  the  project  is  run. 

.INIT.00(256’hl41404FA02EDE20705FDEEF40813FCF408000B0FF8F7F5F6FC0C04E7EBFBF009), 

.INIT.O 1  (256  ’hD4E4 1 5FAFF03E6CAD5B0E629F202 1 8FE 1 2 1 5 1 E320A 1 D2AFE24 1 D 1 2 1 3 1 C20F9 1 1 ), 

.  INIT_02(256  ’h00000000000000000000000000000000 1 02 1 4D3 7530 AF942CFF2 1 9EE9C  ABC  1 C2 ), 

.INIT_03(256’h0000000000000000000000000000000000000000000000000000000000000000), 

.^^.04(256^0000000000000000000000000000000000000000000000000000000000000000), 


(2) ,  In  modulator.l.v  module,  we  can  change  the  60  bits  frame  synchronization  codes  between  OOC  or  M  sequence. 

(3) ,  In  waveform_generator_v2.v  module,  we  send  200ns  dense  pulses  for  start  pattern  searching  before  each  frame, 
and  this  part  can  be  modified.  The  format  is  controlled  by  the  control  signals  ”‘tt_w’”  and  ”‘t_w’”,where  the  first 
one  represents  the  number  of  symbols(1024)  for  each  frame  and  the  later  one  represents  the  number  of  40  ns  of  each 
chip. 

(4) ,  The  DAC  sampling  rate  is  controlled  by  the  clock  signal  ’waveform_clk_500”  which  is  500MHz,  if  we  change 
this  clock  rate,  other  clocks  rate  should  be  changed  accordingly,  the  recommendation  is  to  change  the  overall  param¬ 
eters  in  pll.v  module  by  modifying  the  multiplication  factor  for  all  output  clocks. 

(5) ,  Change  between  using  4  chips  or  16  chips  to  represent  one  bit.  this  can  be  changed  in  modulator.  1  .v  module. 
Notes:  more  details  can  be  found  in  file  ’’‘log.doc’”. 


3.2  FPGA  configuration  files  at  receiver  side 

•  OOKJlx.Synch_V5.TiR. 

Descriptions  This  is  the  main  project  files  folder  in  receiver. 

Functions: 

(l),it  receives  digital  data  streams  from  ADC  board  at  high  speed  of  800  Mb/s,  the  data  width  is  8  bits  with  both 
primary  channel  and  auxiliary  channel. 


(2),  Synchronizes  the  received  frame. 
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(3),  Demodulates  the  data  bits  sent  in  the  transmitter. 

Sources  files  list. 

(1) ,  OOK_Rx_Synch_V80_top.v.  This  is  the  top  module  in  receiver. 

(2) ,  Interface. l.v.  this  is  the  interface  module  which  receive  data  streams  from  ADC,  and  reduce  the  high  speed 
data  to  relatively  low  speed  data  streams  through  techniques  such  as  FIFO,  DDR,  serial  to  parallel  conversion, 
submodules  includes  HSRX  16D_2T01.v,  DDR_REG.v,  fifo_64to64.v,  DDR_REG.v,  BUFJLUT.v  and  pllsecond.v. 

(3) ,  integration_7.v.  this  is  the  integration  module,  it  implements  chip  level  energy  integration  by  using  32  parallel 
DSP48E  based  integrators,  the  results  are  stored  in  32  registers  with  each  corresponds  to  an  integrator,  it  includes 
two  submodules  accumulator^. v,  accu_v7  Jp_500Jse9.v. 

(4) ,  start_pattem.v.  this  moudle  is  used  to  search  the  arrival  of  the  frame  by  comparing  the  integration  results  with 
certain  threshold. 

(5) ,  synch  _searching_l.v.  This  is  the  synchronization  module,  which  combines  the  time  of  arrival  signal  from  start 
pattern  searching  and  the  integration  results  to  get  a  chip  threshold  by  averaging  128  chips,  then  it  synchronize  the 
60  bits  M  sequence  data,  after  that  the  whole  frame  is  synchronized,  it  includes  a  submodule  Maximun_6.v. 

(6) ,  data_process_l.v.  this  module  accumulates  4  chips  integration  result  from  integration  module,  the  result  means 
each  received  data  value  since  we  represent  each  bit  by  4  chips. 

(7) ,  detection. v.  this  is  the  detection  module  which  compares  the  threshold  achieved  in  synchronization  module  and 
the  data  value  from  data  process  module,  and  the  result  is  the  demodulation  result,  which  should  be  the  data  bits  sent 
in  transmitter. 

(8) ,  FSM.  v,  this  is  the  finite  state  machine  module,  which  coordinate  the  implementation  order  of  all  other  process. 

(9) ,  Variable_threshold4.v,  this  is  the  variable  threshold  module,  which  can  change  the  threshold  value  by  push 
buttons  on  FPGA  board,  in  includes  4  thresholds  currently,  the  start  pattern  threshold,  the  chip  threshold  ratio,  the 
frame  level  synchronization  threshold  and  the  ADC  input  bias  shift. 

(10) ,  pllmain.v.  this  is  the  clock  management  module  which  provides  various  clock  signals  for  all  modules. 

Pins  input  and  output: 

NET  ”board.clk”  LOC  =  El  8; 

NET  ’’data-out”  LOC  =  El 3; 

NET  ’Tst-sw”  LOC  =  AD  14; 

NET  ’’REFCLKN”  LOC  =  D8; 

NET  ’’REFCLKP”  LOC  =  C9; 

NET  ’’synch-flag-l”  LOC  =  H26; 

NET  ”RXN _data^A_0”  LOC  =  F8; 

NET  ”RXP_data^A_0”  LOC  -  F7; 

NET  ”RXN _data^A_  1  ”  LOC  =  G9; 

NET  ”RXP-data^A_l”  LOC  =  F9; 

NET  ”RXN_data-A_2”  LOC  =  J8; 
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NET  ”RXP_data_j\_2”  LOC  =  H8; 
NET  ”RXN_data_A_3”  LOC  =  A8; 
NET  ”RXP_dataJ\_3”  LOC  =  A9; 
NET  ”RXP_data.A_4”  LOC  =  E8; 
NET  ”RXN_data_A_4”  LOC  =  E7; 
NET  ”RXN.data_A.5”  LOC  =  C8; 
NET  ’’RXP.data _A_5”  LOC  =  B9; 
NET  ”RXP_data_A_6”  LOC  =  E6; 
NET  ”RXN_data_A_6”  LOC  =  D6; 
NET  ”RXN_data_A_7”  LOC  =  G7; 
NET  ”RXP_data_A_7”  LOC  =  H7; 
NET  ’’RXN.data  _P_0”  LOC  =  C6; 
NET  ”RXP_data_PJ)”  LOC  =  C7; 
NET  ”RXP_data_P_l”  LOC  =  A7; 
NET  ”RXN_data_P_l”  LOC  =  B7; 
NET  ”RXN_data_P_2”  LOC  =  DIO; 
NET  ”RXP_data_P_2”  LOC  =  D9; 
NET  ”RXP_data_P_3”  LOC  =  B6; 
NET  ’’RXN_data_P_3’’  LOC  =  A5; 
NET  ”RXN_data_P_4”  LOC  =  A 10; 
NET  ”RXP_data_P_4”  LOC  =  BIO; 
NET  ”RXP_data„P_5”  LOC  =  A4; 
NET  ”RXN_data_P_5”  LOC  =  A3; 
NET  ’’RXN_data_PJ>’’  LOC  -  A 12; 
NET  ”RXP_data.P_6”  LOC  =  B1 1; 
NET  ”RXP.data_P_7”  LOC  -  B4; 
NET  ’’RXN.data  J>_7”  LOC  =  B5; 
NET  ’’state.O”  LOC  =  F24; 

NET  ’’state.  1”  LOC  =  G25; 

NET  ’’led.bank.O”  LOC  =  H9; 

NET  ’’led.bank.l”  LOC  =  E26; 
NET  ”led_bank_2”  LOC  =  E25; 
NET  ”led_bank_3”  LOC  =  F25; 
NET  ”led.bank_4”  LOC  -  G26; 
NET  ”pb3Jn”  LOC  =  M22; 

NET  ”pb4  Jn”  LOC  =  N22; 

NET  ”pb5Jn”  LOC  =  N23; 

NET  ”pb6in”  LOC  =  N24; 


Key  parameters: 

(1) ,  Start  pattern  threshold  for  finding  frame’s  time  of  arrival,which  depends  on  ADC  bits  we  used  in  interface 
module,  currently  the  value  can  varies  form  3  to  1500  and  adjusted  in  variable  threshold  module. 

(2) ,  The  chip  threshold  ratio  for  chip  level  synchronization,  which  is  adjusted  in  variable  threshold  module  can  can 
change  from  0.375  to  0.75  of  maximum  integration  energy. 


3.3.  DAC  CONFIGURATION  FILES  AT  TRANSMITTER  SIDE 


21 


(3) ,  The  frame  level  synchronization  threshold  for  frame  level  synchronization,  which  defines  the  number  of  tolerent 
errors  in  60  bits,  currently  it  can  change  from  8  to  20  and  can  be  adjusted  in  variable  threshold  module. 

(4) ,  The  ADC  input  bias  shift,  which  defines  the  level  the  ADC  quantization,  currently  can  change  from  128  to  160 
and  can  be  adjusted  in  variable  threshold  module. 

(5) ,  the  ADC  bits  used,  this  value  is  defined  in  interface  module  such  as  ”‘data_0t  j=  RX_data[6:0];”’,  the  syntax 
means  we  are  currently  using  7  bits  of  the  ADC,  and  we  can  change  to  use  all  8  bits  by  ”‘data_0t  j=  RX_data[7:0];’”, 
and  other  7  similar  syntax  should  be  changed  accordingly. 

(6) ,  the  delay  cycles  of  the  beginning  of  synchronization  after  finding  the  time  of  arrival  which  defined  by  200ns 
dense  pulses.  The  actual  delay  should  be  2  chips  time,  but  the  delay  cannot  be  determined  because  of  the  uncertainly 
of  circuits’  delay,  currently  it  can  change  from  one  chips  time  to  3  chips  time  in  start  pattern  searching  module. 

(7) ,  change  between  4  chips  and  16  chips  system,  this  can  be  done  by  change  to  accumulate  16  chips  value  in  data 
process  module  and  to  increase  the  chip/value  threshold  to  4  times.some  other  control  signal  also  need  to  adjusted 
in  data  process  module. 

(8) ,  Integration  window,  the  system  is  capable  of  increasing  integration  window  from  10ns  for  time  reversal  system 
to  20  ns  for  pulse  system,  this  value  is  adjusted  in  integration  module. 

Notes:  more  details  can  be  found  in  file  ’’Tog.doc”’. 
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•  Fujitsu -config. 


Descriptions’  This  is  the  DAC  configuration  files  folder  in  transmitter. 

There  are  two  configuration  cases  for  DAC  in  the  system  ’s  demonstration,  the  first  one  is  to  use  DAC  mem¬ 
ory  to  transmit  waveform  for  observing  focus  in  receiver,  and  the  second  one  is  to  use  FPGA  to  transmit  data 
for  data  demodulation,  the  files  are  in  the  folder  of  ’’‘data’”.  Each  time  the  registers  are  configured,  the  file 
”‘MB86064^ab_on_forJvds_data.txt’”  need  to  be  downloaded  to  the  DAC  and  the  sampling  clock(500MHz)  is  pro¬ 
vided  by  FPGA. 

(1) ,  DAC  memory  case,  the  waveform  files  are  loaded  to  DAC  RAM  A  for  Q  channel  and  RAM  B  for  1  channel, 
the  files  format  should  be  ’vec\  there  are  two  kind  of  files,  the  time  reversal  file”’  ”cand  the  pulse  file”””,  actually 
there  are  many  other  waveforms  in  the  folder  such  as  ramps  and  sine,  after  RAMs  loaded,  the  RAMs  need  to  be 
synchronized,  and  MB 8 6064 ^ab.on _for_lvds_data.txt”’  file  need  to  be  loaded 

(2) ,  FPGA  case,  the  waveform  is  directly  form  FPGA  output,currently  we  are  using  8  bits  of  total  14  bits,  we  only 
need  to  load  MB 8 6064 ^ab_on_for _lvds_data.txt”’  file 
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Figure  4.1:  VNA  measurement  setup. 


4.1  VNA  Based  CIR  Measurement  and  Post  Data  Processing 


A  VNA  can  accurately  measure  the  channel  characteristics  with  large  bandwidth.  Since  the  Time  Reversal  waveform 
is  a  passband  signal,  it  can  be  generated  from  I/Q  modulating  a  pair  of  baseband  waveforms  with  a  carrier.  In  this 
project  we  use  the  VNA  and  post  signal  processing  to  estimate  the  CIR. 

According  to  the  testbed  specification,  the  VNA  frequency  is  set  to  sweep  from  3.5  GHz  to  4.5  GHz  to  cover  all 
frequencies  of  interest.  The  frequency  step  is  set  to  1  MHz  since  the  delay  spread  of  the  indoor  channel  is  less  than 
1  /is.  To  get  better  signal-to-noise  ratio  (SNR)  data,  the  number  of  averages  is  set  to  256.  The  system  setup  of  VNA 
measurement  is  depicted  in  Fig.  4.1 .  Two  50-feet  cables  with  SMA  connectors,  a  pair  of  antennas  and  a  30-dB  gain 
low -noise-amplifier  (LNA)  are  used  VNA  transmit  power  levels  at  the  two  ports  have  to  be  carefully  selected  to 
prevent  from  damaging  the  LNA.  The  “decoupled”  option  has  been  used  for  flexible  power  setting  in  forward  and 
reverse  directions.  Cable  loss  and  path  loss  together  contribute  an  attenuation  over  100  dB,  which  can  make  the 
receive  signal  at  the  VNA  port  too  weak  to  detect.  Thus  additional  LNA  may  be  necessary  to  compensate  the  losses. 
The  connections  must  be  tight  to  minimize  measurement  inaccuracy. 

The  VNA  must  be  calibrated  in  prior  to  actual  measurement.  VNA  calibration  eliminates  the  impairments  introduced 
by  the  connecting  parts  along  the  signal  chain  between  Port  1  and  Port  2,  so  that  the  measurement  data  solely 
reflects  the  characteristic  of  the  device  under  test  (DUT).  In  this  measurement  the  DUT  is  the  channel  include  the 
propagation  channel,  the  LNA  and  the  antennas.  However,  since  the  total  cable  loss  is  around  35  dB  at  4  GHz,  the 
VNA  calibration  is  not  able  to  compensate  such  amount  of  loss.  In  fact,  the  cable’s  characteristic  can  be  measured 
separately  and  the  cable’s  transfer  function  can  be  eliminated  in  off-line  post  signal  processing. 

The  total  filtering  effect  of  the  system’s  RF  front-ends  at  both  sides  should  be  taken  into  account.  It  has  been 
found  that  this  equivalent  filter  in  baseband  representation  is  dominated  by  the  D/A’s  interpolating  filer  which  is 
measurable.  The  measured  data  of  the  inteipolatmg  filter  has  been  combined  with  the  VNA  measurement  data  to 
make  the  CIR  estimate  more  accurate. 

After  measurement,  we  load  the  frequency  domain  data  into  a  Matlab  program  and  get  the  baseband  time  reversal 
I/Q  waveform.  The  VNA  data  is  denoted  as  C  [/],/;</  <  fh ,  with  center  frequency  /q.  We  shift  the  signal  to 
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Figure  4.2:  Environment  setup  for  DAC  test. 


baseband  and  get  C  [f  —  /o].  Then,  we  can  get  the  baseband  I/Q  waveform  according  to  the  following  equations: 


c[t\IFTC\f-f0} 
c  [t]  =  a[t]  +jb  [<] 

8  [£]  ==  Re  {c  [t]  ej27r^otJ  =  Re  {(a  [t]  +  jb  [f])  (cos  finfot]  +  j sin  [27r/o£])}  =  a  [<]  cos  [2nfot}  —  b  [t]  sin  [2n f0t) 


where  s[f]  is  the  modulated  signal,  whose  positive  frequency  is  the  conjugation  of  the  channel  transfer  function.  So, 
we  have  baseband  I/Q  waveform  as: 


/  [<]  =  a  [<] 

Q  [<]  =  ~b  [<] 


This  baseband  I/Q  waveform  is  loaded  into  D/A’s  memory  to  test  whether  a  desired  signal  can  be  generated.  An 
experiment  in  a  metallic  box  was  conducted  for  this  verification  purpose.  The  layout  of  the  experiment  is  illustrated 
in  Fig.  4.2.  A  transmit  antenna  and  a  receive  antenna  are  blocked  by  a  metal  board.  The  whole  box  serves  as  a 
confined  propagation  environment.  The  magnitude  of  a  CIR  measured  in  the  box  is  depicted  in  Fig.  4.1,  and  the 
quantized  I/Q  waveform  is  shown  in  Fig.  4.4  as  well  as  Fig.  4.5.  The  received  waveform  at  the  detector  is  observed 
with  an  oscilloscope.  According  to  time  reversal  phenomenon,  one  strong  envelope  peak  should  be  observed.  Fig. 
4.6  shows  an  incorrect  result.  This  is  due  to  the  unmatched  I/Q  waveform  caused  by  a  50  MHz  LO  shift.  Fig.  4.7 
shows  the  result  with  matched  transmit  waveform.  At  this  point  we  can  say  that  the  acquired  I/Q  waveforms  and  the 
system  work  properly. 

Other  issues  to  consider  include  the  waveform  length  truncation  and  vertical  clipping  due  to  the  implementation 
limits  in  the  digital  FIR  filter.  For  time  reversal  precoding,  although  the  actual  length  of  CIR  may  excess  160  ns,  the 
testbed  supports  waveforms  of  length  160  ns  (160  points)  at  1-Gsps  sampling  rate.  To  maximize  the  performance 
under  this  length  limit,  a  160-ns  long  sliced  waveform  with  maximum  energy  is  selected.  It  is  reasonable  to  set  a 
160-ns  length  limit  because  the  delay  spread  in  a  typical  indoor  environment  is  less  than  160  ns.  The  waveforms 
can  be  overlapped  as  the  chip  duration  is  40  ns,  less  than  160  ns.  Vertical  clipping  may  occur  since  the  number  of 
quantization  bits  is  limited.  Chapter  2.4  has  provided  a  phase  rotation  method  to  maximally  utilize  the  quantization 
ability  and  avoid  clipping.  As  will  be  discussed  in  the  system  trail  section,  minor  clipping  is  tolerable  in  time  reversal 
precoding.  In  other  words,  we  can  make  phase  rotation  solely  based  on  a  single  chip  waveform,  without  considering 
the  impact  of  chip  waveform  overlapping.  This  simplifies  the  process  but  usually  leads  to  some  insignificant  clipping 
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Figure  4.3:  Envelope  of  the  channel  impulse  response  in  the  metallic  box. 


Figure  4.4:  Quantized  I  channel. 
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Figure  4.5:  Quantized  Q  channel. 
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Figure  4.6:  Received  waveform  with  50  MHz  frequency  offset  of  the  transmit  waveform  caused  by  LO. 
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Figure  4.7:  Received  waveform  with  matched  transmit  waveform.  Measured  with  DSO  at  the  detector. 
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4.1,1  Main  Files  for  Channel  Measurement  and  Post  Data  Processing 

All  files  are  placed  under  the  same  folder 
Input  files: 

•  CHANNEL.cti 

Description:  This  file  stores  the  S21  parameter  of  the  channel.  It  is  the  data  file  obtained  from  VNA  measurement. 
‘CHANNEL’  is  user  defined 

•  cableJ3_5.cti 

Description:  This  is  the  S21  parameter  of  two  50  ft  SMA  cables  connecting  together. 

•  DAC_lns.mat 

Description:  This  is  the  time  domain  1  ns  pulse  generated  by  DAC.  It  is  captured  by  DPO. 

Main  processing  file" 

•  Load_FIR.Coef_VNA.rn 

Description:  This  is  the  main  file  for  post  data  processing.  Input  files  are  loaded  and  several  output  files  for  FPGA 
and  DAC  will  be  generated. 

Output  files 

•  Xpt_lJ_FPGA.txt,  Xpt_l_Q_FPGA.txt, 

Description:  These  are  the  I/Q  waveform  template  files  loaded  in  FPGA.  X  is  the  number  of  points  for  the  truncated 
CIR. 


•  FILENAME J.vec,  FILENAME.Q.vec 


Description:  These  are  the  I/Q  waveform  template  files  loaded  in  DAC.  FILENAME  is  the  filename  of  the  data  file 
storing  S21  parameters. 
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5.1  Introduction 


Motivated  by  increasing  demand  for  cheap  wireless  sensor  networks,  a  wideband  radio  system  combining  waveform 
precoding  and  simple  receivers  is  considered  in  this  paper.  Among  receiver  options  is  energy  detector  receiver  that 
is  attractive  because  of  its  compromising  between  performance  and  simplicity.  The  central  idea  is  to  pair  the  simple 
receiver  with  a  sophisticated  transmitter  which  is  able  to  transmit  an  optimal  waveform  to  achieve  performance  gain. 
Such  type  of  transmitter  featured  by  an  arbitrary  waveform  generator  has  been  tested  in  our  laboratorial  UWB  test¬ 
bed,  thanks  to  the  recent  technology  advanced  in  semiconductors.  Although  multiple  of  Nyquist  sampling  rate  has 
been  feasible  for  Giga-Hertz  bandwidth,  reduction  in  quantization  and  transmitted  peak  power  is  very  desired  from 
implementation  perspective. 

Four  practical  cases  are  considered  in  this  paper,  which  are  all  critical  issues  when  we  built  our  laboratorial  UWB 
test-bed.  There  should  be  a  tradeoff  between  eneigies  within  and  outside  integration  window  if  inter-symbol- 
interference  (I SI)  has  to  be  concerned.  Because  of  the  hardware  limitation  or  implementation  simplicity,  a  low- 
resolution  quantization  has  to  be  employed  sometimes.  Thus  the  design  for  binary  waveform  and  ternary  waveform 
are  studied.  Meanwhile,  binary  waveform  or  ternary  waveform  can  be  treated  as  one  kind  of  robust  waveform,  if 
channel  estimation  is  not  accurate.  For  waveform  design  in  the  digital  domain,  one  of  the  challenging  issues  is 
Peak-to-Average  Power  Ratio  (PAPR)  because  of  the  nonlinear  devices  used  in  the  system.  The  transmitted  peak 
power  should  be  reduced  such  that  the  nonlinear  devices  can  work  in  the  proper  region.  All  the  practical  cases 
are  formulated  as  the  corresponding  optimization  problems.  By  using  the  advanced  optimization  tool,  the  optimal 
solution  or  suboptimal  solution  to  these  optimization  problems  can  be  achieved.  The  work  in  this  papier  can  not  only 
make  the  contribution  to  the  theoretical  research  in  waveform  optimization  and  synthesis  but  also  give  the  guidance 
for  the  real  system  design  and  implementation. 

The  rest  of  the  paper  is  organized  as  follows.  The  system  is  described  in  Section  II.  Practical  considerations  for 
wideband  waveform  design  are  presented  in  Section  III.  Numerical  results  are  provided  in  Section  IV,  followed  by 
some  remarks  given  in  Section  V. 


5.2  System  Description  and  Optimal  Waveform 


The  system  architecture  is  shown  in  Figure  5.1 .  We  limit  our  discussion  to  a  single-user  scenario,  and  consider  the 
transmitted  signal  with  OOK  modulation  given  by 


oo 

*(<)“  53  djP{t-jTb ) 

j=-oo 


(5.1) 


where  Tb  is  the  symbol  duration,  p(t)  is  the  transmitted  symbol  waveform  defined  over  [0,  Tp]  and  d  6  {0, 1}  is 
j-th  transmitted  bit.  Without  loss  of  generality,  assume  the  minimal  propagation  delay  is  equal  to  zero.  The  energy 
of  p(t)  is  Ep, 


dt  =  Ev 


(5.2) 
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Figure  5.1;  System  architecture. 

The  received  noise-polluted  signal  at  the  output  of  the  receiver  front-end  filter  is 

r(t)  =  /i(Z)<g)  s(t)  +n(t) 

oo 

=  ^  djx(t-jTb)  +  n(t),  (5.3) 

j  =  —  OO 

where  h  (t) ,  t  €  [0,  T/J  is  the  multipath  impulse  response  that  takes  into  account  the  effect  of  channel  impulse 
response,  the  RF  front-ends  in  the  transceivers  including  antennas.  h(t)  is  available  at  the  transmitter.  “(8)”  denotes 
convolution  operation.  n(t)  is  a  low-pass  additive  zero-mean  Gaussian  noise  with  one-sided  bandwidth  W  and 
one-sided  power  spectral  density  :r(<)  is  the  received  noiseless  symbol-‘T’  waveform  defined  as 

x(t)  =  h(t)  ®p(t)  (5.4) 


def , 


We  further  assume  that  Tb  >  +  Tp  —  Tx,  i.e.  no  existence  of  ISI. 

An  energy  detector  performs  square  operation  to  r(t).  Then  the  integrator  does  the  integration  over  a  given  integra¬ 
tion  window  T/.  Corresponding  to  the  time  index  A;,  the  fc-th  decision  statistic  at  the  output  of  the  integrator  is  given 
by 


=/: 

=jC 


rfcTb-fT/o+T/ 

Zk=  I  r2(t)dt 

//cTb+T/o 

fkTb+Tio+Tj 


(d^x(t  —  kTb)  +  n(t))2dt 

JkTb+TI() 

where  Tjq  is  the  starting  time  of  integration  for  each  symbol  and  0  <  Tjq  <  Tjq  +  Tj  <  Tx  <  Tb. 


(5.5) 

(5.6) 


An  approximately  equivalent  SNR  for  the  energy  detector  receiver,  which  provides  the  same  detection  performance 
when  applied  to  a  coherent  receiver,  is  given  as 


SNR^q  = 


2.37/ W7 Nfi  +  N0  ///0nfT'  x2  ( t )  dt 


(5.7) 


For  best  performance,  the  equivalent  SNR  SNR^q  should  be  maximized.  Define, 

rTia+Ti 


Ej 


r*i<] 
JTj  o 


x2  (t)  dt 


(5.8) 
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For  given  Tj,  Nq  and  Wy  SNR^q  is  the  increasing  function  of  Ej.  So  the  maximization  of  SNR^q  in  Equation  5.7  is 
equvalent  to  the  maximization  of  Ej  in  Equation  5.8. 


So  the  optimization  problem  to  get  the  optimal  p  is  shown  below. 


max  +  Tl  x2  (i)  dt 

s  t-  Jo7’’  P2  (0  dt  =  Ev 


(5.9) 


In  order  to  solve  the  optimization  problem  5.9,  numerical  approach  is  employed  in  this  paper.  In  other  words,  p(<), 
h(t)  and  z(<)  are  uniformly  sampled  (assumed  at  Nyquist  rate),  and  the  optimization  problem  5.9  will  be  converted 
to  its  corresponding  discrete-time  form.  Assume  the  sampling  period  is  T8.  Tp/T9  =  Npy  Th/T9  =  Nh  and 
Tx/Ts  =  Nx.  So  NX  =  NP  +  Nh. 

p(t),  h(t)  and  x(t )  are  represented  by  pt-,  i  =  0, 1, . . . ,  Np>  h{,  i  =  0, 1, . . . ,  Nh  and  xt,  t  =  0, 1, ,  Nx  respectively. 


Define, 


and 


P  =  [po  Pi  •  •  •  Pnp]'j 
x  =  [x0  XI  •••  xNl}7 


(5.10) 

(5.11) 


Construct  channel  matrix  H(^;r+1)><(^p+1), 

no.,- = { 


j,0  <i-j  <  Nh 
else 


(5.12) 


where  (•),—  denotes  the  entry  in  the  i- th  row  and  j-th  column  of  the  matrix  or  vector.  Meanwhile,  for  vector,  taking 
p  as  an  example,  (p)t  x  is  equivalent  to  pi-\. 


The  matrix  expression  of  Equation  5.4  is, 


(5.13) 


x  =  Hp 

and  the  constraint  in  the  optimization  problem  5.9  can  be  expressed  as, 

||p|| \Ts  =  Ep  (5.14) 

where  denotes  the  norm-2  of  the  vector.  In  order  to  make  the  whole  paper  consistent,  we  further  assume, 


IIpIIj  =  1 


(5.15) 


Let  Tj/T9  =  Nj  and  Tjq/T8  =  Njq.  The  entries  in  x  within  integration  window  constitute  X/  as, 

X/  =  [i/v,„  xNin+i  ■  ■  ■  xNia+Nl]T  (5.16) 

and  Ei  in  Equation  5.8  can  be  equivalently  shown  as, 

Ej  =  ||x/  H2  T,  (5.17) 


Simply  dropping  T9  in  Ej  will  not  affect  the  optimization  objective,  so  Ej  is  redefined  as. 


El  =  Ml 


(5.18) 
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Similar  to  Equation  5.13,  x/  can  be  obtained  by, 

x/  =  H/p 

where  (H /)fj-  =  (H) Nl0+ij  and  t  =  1, 2, . . . ,  Ni  + 1  as  well  as  j  =  1, 2, . . . ,  Np  +  1. 

The  optimization  problem  5.9  can  be  represented  by  its  discrete-time  form  as, 

max  Ei 

ipiii  =  i 

The  optimal  solution  p*  for  the  optimization  problem  5.20  is  the  dominant  eigen-vector  in  the  following  eigen¬ 
function, 

H/  H/p  =  Ap  (5.21) 

Furthermore,  E]  will  be  obtained  by  Equation  5.18  and  Equation  5.19. 


(5.19) 


(5.20) 


53  Waveform  Design  with  Practical  Considerations 

53.1  Tradeoff  between  Energies  Within  And  Outside  Integration  Window 

The  energy  outside  the  integration  window  needs  to  be  concerned  sometimes,  say,  when  ISI  has  to  be  considered.  In 
order  to  reduce  ISI,  the  energies  within  and  outside  integration  window  should  be  balanced,  which  means  the  energy 
within  integration  window  should  be  maximized  rind  the  energy  outside  integration  window  should  be  minimized. 

The  entries  in  x  outside  integration  window  constitute  x/  as, 

X/  =  [*0  •  ■  •  arjV/o  ,  XN,„+N,+ 1  *  •  *  xNx]T  (5.22) 

and  the  energy  outside  integration  window  Ej  can  be  expressed  as, 

Ej  =  ||x/||i  (5.23) 

Similar  to  Equation  5. 19,  x/  can  be  obtained  by, 

Xj  =  H,p  (5.24) 

where  (H/)fj.  =  (H)fj-  when  i  =  1, . . . ,  NI0  and  (H/),._(JV;+1)j.  =  (H)4j.  when  i  =  NI0  +  Nj  +  2, . . . ,  Nx  +  1 
as  well  as  j  =  1,2, . . .  ,Np  4- 1. 

In  order  to  balance  energies  within  and  outside  integration  window,  the  tradeoff  factor  a  is  introduced.  The  range  of 
a  is  from  0  to  1.  Given  a,  the  optimization  problems  is  formulated  as  follows, 

max  aPJ j  —  (1  —  a)Ef 

o  /  1  (5.25) 

s.t.  ||p||2  =  1 

The  optimal  solution  p*  for  the  optimization  problem  5.25  is  the  dominant  eigen-vector  in  the  following  eigen¬ 
function, 

[aHfH/  -  (1  -  a)HjH/]  p  =  Ap 


(5.26) 
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5.3.2  Binary  Waveform 

If  the  transmitted  waveform  is  constrained  to  the  binaiy  waveform  because  of  the  hardware  limitation  or  implemen¬ 
tation  simplicity,  which  means  pt-,  i  =  0, 1, ,  Np  is  equal  to - t1  or 

is  expressed  as, 


=,  then  the  optimization  problem 


max  Ei 

s-*.[(p)i,l]2  “  i-iTVp  ’  *  =  1j  •  *  • » Np 


(5.27) 


One  suboptimal  solution  to  the  optimization  problem  5.27  is  derived  from  the  optimal  solution  p*  of  the  opti¬ 
mization  problem  5.20.  When  p*  is  obtained,  then 


(p.,  J  7^'(p')"'ao 

'''  \  “7i7j<;'<p‘)«<0 


(5.28) 


This  simple  method  can  lead  to  the  optimal  solution  to  the  optimization  problem  5.27  when  Tj  — ►  0,  which  can  be 
proofed  by  CauchySchwarz  inequality,  but  if  Tj  is  greater  than  zero,  there  is  still  a  improvement  potential  to  this 
suboptimal  solution  obtained  from  Equation  5.28. 

It  is  well  known  that  the  optimization  problem  5.27  is  Quadratically  Constrained  Quadratic  Program  (QCQP)  and 
general  QCQP  is  NP-hard,  so  a  semidefmite  relaxation  method  is  proposed  to  give  the  suboptimal  solution  to  this 
optimization  problem. 

Define, 

p  =  PP'  (5  29) 

P  should  be  a  symmetric  positive  semidefmite  matrix,  i  e.  P  X=  0  and  rank  of  P  should  be  equal  to  1.  Reformulate 
Ei  as, 


Ei  =  p7H|H/P 


=  trace  (H7  H/pp‘  ) 
=  trace  (HjTH/P) 


(5.30) 

(5.31) 

(5.32) 


Rank  constraint  is  nonconvex  constraint,  so  after  dropping  it,  QCQP  is  relaxed  to  the  Semidefimte  Program  (SDP), 


max  trace  (Hj  H/P) 
s.t.  (P)T*  t*  =  Y+Np  ’  *  ==  0,1,...,  Np 
P  >-=  0 


(5.33) 


The  optimal  solution  P*  of  the  optimization  problem  5.33  can  be  obtained  by  using  CVX  tool  from  Stanford  Univer¬ 
sity  and  the  value  of  the  objective  function  in  the  optimizatiion  problem  5.33  gives  the  upper  bound  of  the  optimal 


value  in  the  optimization  problem  5.27.  Project  the  dominant  eigen -vector  of  P*  on  — 


and 


based 


y/i+Np  \Z1+Arp 

on  Equation  5.28,  the  suboptimal  solution  p£2  is  achieved  using  the  method  proposed  in  Semidefinite  Optimization 
with  Applications  in  Sparse  Multivariate  Statistics. 


Finally,  the  designed  binaiy  waveform  is. 


pi  =  arg  max  p^H^H/p 

l*{PfcpP62} 


(5.34) 
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5.3.3  Ternary  Waveform 


If  the  transmitted  waveform  is  constrained  to  the  ternary  waveform,  which  means  p„  i  =  0, 1, . . . ,  Np  is  equal  to 
three  levels,  i.e.  — c,  0  or  c,  then  the  optimization  problem  is  expressed  as, 

max  Ei 

s.t.[(Pki}2  =  orO,  i  =  0,l,...,Np  (5.35) 

Up  II 2  =  i 

where  the  value  of  c  will  be  determined  later. 


The  optimization  problem  5.35  is  still  NP-hard  and  can  be  approximately  reformulated  as, 

max  Ej 

,s. £. Cardinality  (p)  <  k  (5.36) 

1  <  k  <  Np  +  1  HpHI  =  1 

where  Cardinality  (p)  denotes  the  number  of  non-zero  entries  of  p  and  cardinality  constraint  is  also  a  nonconvex 
constraint. 


Because  k  is  the  integer  number  between  1  and  Np  -I-  1,  the  optimization  problem  5.36  can  be  decomposed  into 
Np  4*  1  independent  and  parallel  sub-problems  and  each  sub-problem  is  shown  as, 

max  Ej 

s.t. Cardinality  (p)  <  k  (5.37) 

Up  II 2  =  i 

where  k  is  equal  to  1, 2,  *  *  * ,  or  Np  -f  1; 


The  sub-problems  5.37  can  be  solved  in  parallel  and  then  the  solutions  are  combined  to  get  the  solution  of  the 
original  optimization  problem  5.35  Reuse  the  definition  m  Equation  5.29  and  the  sub  problem  5.37  can  converted 
to  the  following  SDP  by  semidefinite  relaxation  combined  with  11  heuristic  proposed  in  Semidefinite  Optimization 
with  Applications  in  Sparse  Multivariate  Statistics. 


max  trace  (Hj  H/P) 
.si.  trace  (P)  =  1 
ar  |P|  a  <  k 
Py=0 


(5.38) 


where  a  is  all- 1  column  vector  and, 


IIpIII  =  p7p 

(5.39) 

=  trace  (pp7 ) 

(5.40) 

=  trace  (P) 

(5.41) 

The  CVX  tool  from  Stanford  University  is  also  operated  to  get  the  optimal  solution  P£  of  SDP  5.38.  From  the 
dominant  eigen -vector  p£  of  P*k  and  the  threshold  pthk>  solution  for  the  sub-problem  5.37  can  be  achieved  as, 

Ck,  (Pfc)f,l  >  Pthk 
0,  j(Pfc)t>i  |  <  Pthk 
.  ~cki  (Pfc),,!  <~  Pthk 


(Ptk)i,l  =  < 


(5.42) 
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where 


and 


Pthk  =  arg  max  (p*k)  Hj  H/p*k 

{Pth} 

s.t. Cardinality  (p*k)  <  k 


Ck  = 


yj Cardinality  (p*k) 


(5.43) 

(5  44) 


Finally,  the  designed  ternary  waveform  is, 

Pz  =  are  max  pJH/H/p 

Pe{P;k^i,2,...,/vp+i} 


(5.45) 


5.3.4  Peak-to-Average  Power  Ratio 

PAPR  is  one  of  major  concerns  in  waveform  design.  Because  of  nonlinearity  caused  by  nonlinear  devices  such  as 
Digital -to- Analog  Converter  (DAC)  and  Power  Amplifier  (PA),  maximal  transmitted  power  has  to  be  backed  up, 
resulting  in  inefficient  utilization.  PAPR  in  OFDM  has  been  well  studied.  In  this  paper,  PAPR  is  handled  under  a 
unified  optimization  framework.  It  is  defined  as. 


PAPR  = 


IIpIIL 

IIpIII/ (wP+  i) 


(5.46) 


where 

IIpIIoo  =  max  (IpoI  >|pi|.---  ,|pnp|)  (5.47) 

If  ||p||2  is  given,  reducing  PAPR  is  equivalent  to  setting  the  upper  bound  for  HpII^.  So  the  optimization  problem 
can  be  expressed  as, 

max  Ej 

S.t.  Ml  =  1  (5.48) 

IIpIIoo  ^  ub 

The  bound  constraint  HpIIqq  £  ub  can  also  be  written  as, 

— ub  <  pi  <  ub,  i  =  0, 1, . . . ,  Np  (5.49) 

which  can  be  further  simplified  as, 

P,2<  (ub)2,i  =  0,l,...,(Vp  (5.50) 

Reuse  the  definition  in  Equation  5.29,  the  optimization  problem  5.46  can  be  relaxed  to  SDP, 

max  trace  (H  J  H /P) 
s.t.  (P)t  t-  <  (ub)2 ,2  =  0,1,...,  Np 
trace  (P)  =  1 
P  >-=  0 


(5.51) 
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Figure  5.2:  Multhpath  impulse  response. 

By  CVX  tool  from  Stanford  University,  the  optimal  solution  P*  of  the  optimization  problem  5.5 1  is  obtained.  If  the 
rank  of  P*  is  equal  to  1,  then  the  dominant  eigen-vector  of  P*  will  be  the  optimal  solution  p*  for  the  optimization 
problem  5.48.  But  if  the  rank  of  P*  is  not  equal  to  1,  the  dominant  eigen-vector  of  P*  can  not  be  treated  as  the 
optimal  solution  for  the  optimization  problem  5.48,  because  of  the  violation  of  bound  constraint. 


So  a  computationally-efficient  iterative  algorithm  is  proposed  to  get  the  suboptimal  solution  p*  to  the  optimization 
problem  5.48  as  follows. 

1.  Initialization:  P  =  1,  Ho  =  H/  and  p*  is  set  to  be  all-0  column  vector. 

2.  Solve  the  following  optimization  problem  to  get  the  optimal  q. 


max  q;  Hoq 

s.t.  Ml  =  p 


(5.52) 


3.  Find  i,  such  that  |qrt-|  is  the  maximal  value  in  the  set  {|</y|  \\qj\  >  ub}.  If  {i}  =  0,  then  the  algorithm  is  terminated 
and  p*  :=  p*  +  q.  Otherwise  go  to  step  4. 

4.  If  qi  is  greater  than  zero,  then  (p*)t-  x  is  set  to  be  ub.  Otherwise  (p*)^  is  set  to  be  — ub. 

5.  P  :=  P  —  (ub)2  and  set  (H0)tJ- ,  j  —  1, 2, . . . ,  Np  -+■  1  and  (H0)yt- ,  j  =  1, 2, . . . ,  Np  +  1  all  to  zeros.  Go  to  step 
2. 


5,4  Numerical  Results 

Figure  5.2  shows  the  multipath  impulse  response  h(t)  under  investigation  in  this  paper  and  the  energy  of  h(t)  is 
normalized.  T8  =  0.5ns,  Th  =  100ns,  Tp  =  100ns  and  7>o  +  =  100ns. 

Figure  5.3  shows  the  energy  Ej  in  the  integration  window  defined  by  Equation  5.18  when  different  tradeoff  factor  a's 
are  chosen.  Figure  5.4  shows  the  the  corresponding  ratio  of  energies  Ej/Ej  within  and  outside  integration  window. 
When  a  is  close  to  1,  the  eneigy  within  integration  window  will  have  more  weight.  The  optimization  objective  is 
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Figure  5.3:  The  energy  within  the  integration  window. 


Figure  5.4:  The  ratio  of  energies  within  and  outside  integration  window. 

to  maximize  the  energy  within  integration  window.  Thus  the  larger  a,  the  more  energy  within  integration  window. 
However,  when  a  is  close  to  0,  the  optimization  objective  will  be  to  minimize  the  energy  outside  integration  window. 
From  Figure  5.4,  the  trend  is,  the  smaller  a,  the  bigger  the  ratio  of  energies  within  and  outside  integration  window. 
This  results  give  us  the  hint  to  design  the  system  with  proper  parameters  if  ISI  is  introduced. 

For  PAPR,  if  Tj  — ►  0  and  ub  is  set  to  be  0.4,  the  designed  waveforms  are  shown  in  Figure  5.5.  In  this  case,  the 
optimal  waveform  without  consideration  of  PAPR,  the  waveform  obtained  by  SDP  and  the  waveform  achieved  by 
iterative  algorithm  are  the  same.  The  maximal  eigen-value  of  P*  in  the  optimization  problem  5.51  approaches  1, 
which  means  the  rank  of  P*  is  1.  So  waveform  obtained  by  SDP  will  not  violate  the  bound  constraint  Meanwhile, 
if  Tj  — ►  0  and  ub  is  not  tight  the  optimal  waveform  is  the  time  reversed  multipath  impulse  response,  if  Tj  — ►  0  and 
ub  is  set  to  be  0.15,  the  designed  waveforms  are  shown  in  Figure  5.6.  The  optimal  waveform  without  consideration 
of  PAPR  is  still  the  time  reversed  multipath  impulse  response.  The  waveform  obtained  by  SDP  is  equivalent  to  that 
achieved  by  iterative  algorithm,  which  means  both  of  two  methods  give  the  optimal  solution  under  PAPR  constraint. 

If  Tj  is  equal  to  2 ns  and  ub  is  set  to  be  0.15,  the  designed  waveforms  are  shown  in  Figure  5.7.  Because  the  rank 
of  P*  in  the  optimization  problem  5.51  is  equal  to  1,  the  waveform  obtained  by  SDP  gives  the  optimal  solution. 
Meanwhile  the  waveform  achieved  by  iterative  algorithm  gives  the  suboptimal  solution  and  the  ratio  of  energies 
within  the  integration  window  by  the  suboptimal  waveform  and  the  optimal  waveform  with  PAPR  is  0.99.  If  ub  is 
set  to  be  0.1,  the  designed  waveforms  are  shown  in  Figure  5.8.  The  rank  of  P*  in  the  optimization  problem  5.51  is 
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Figure  5.5:  The  designed  waveforms  if  Jj  — ►  0  and  ub  is  set  to  be  0.4. 


Figure  5.6:  The  designed  waveforms  if  Tj  — ►  0  and  ub  is  set  to  be  0.15. 

equal  to  2  in  this  case,  so  the  waveform  obtained  by  SDP  does  not  give  the  solution.  It  is  easy  to  see  from  Figure  5.8 
that  this  waveform  violates  the  bound  constraints  in  some  samples. 

In  order  to  compare  the  performance  of  optimal  waveform,  binary  waveform  and  ternary  waveform,  the  down- 
sampled  multipath  impulse  response  in  Figure  5.2  is  used.  Figure  5.9  shows  the  energies  when  optimal  waveform, 
binary  waveform  and  ternary  waveform  are  employed.  Because  of  SDP  relaxation,  the  proposed  algorithms  to  get  bi¬ 
nary  waveform  or  ternary  waveform  can  not  guarantee  optimum  from  energy’s  point  of  view  all  the  times.  However, 
energy  of  the  suboptimal  binary  or  ternary  waveform  approaches  that  of  the  optimal  binary  or  ternary  waveform  very 
well.  Take  this  case  as  an  example,  the  suboptimal  ternary  waveform  catches  at  least  94  percent  energy  compared  the 
optimal  ternary  waveform,  while  for  binary  waveform,  this  number  is  well  above  0.99.  Meanwhile,  ternary  wave¬ 
form  can  obtain  more  than  80  percent  energy  from  the  optimal  waveform,  which  makes  ternary  waveform  competent 
in  this  kind  of  system  with  energy  detector  receiver. 


5.5  Conclusion 


Wideband  waveform  optimization  with  energy  detector  receiver  has  been  studied  in  this  paper.  This  work  is  a  part 
of  our  effort  in  searching  for  simple-receiver  solutions  with  enhanced  performance.  The  contribution  of  this  paper 
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Figure  5.7:  The  designed  waveforms  if  Tj  is  equal  to  2ns  and  ub  is  set  to  be  0.15. 


Figure  5.8:  The  designed  waveforms  if  7/  is  equal  to  2ns  and  ub  is  set  to  be  0.1. 

is  to  combine  waveform  design  and  optimization  with  the  practical  situations.  Four  practical  cases  are  considered 
in  this  paper,  i.e.  the  tradeoff  between  energies  within  and  outside  integration  window,  binary  waveform,  ternary 
waveform  and  PAPR.  The  methods  and  results  of  this  paper  can  give  us  the  guidance  for  the  real  system  design  and 
implementation. 
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Figure  5.9:  The  energies  when  optimal  waveform,  binary  waveform  and  ternary  waveform  are  employed. 


